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§J ■ ABSTRACT 

We present the results of an analysis of the rate of evolution of the Ly-a absorption 
lines in the redshift interval 0.0 to ~1.5 based upon a sample of 987 Ly-a absorption 
lines identified in the spectra of 63 QSOs obtained with the Faint Object Spectrograph 
(FOS) of the Hubble Space Telescope (HST). These spectra were obtained as part of 
the QSO Absorption Line Survey, an HST Key Project during the first four years of 
\ observations with the telescope. Fits to the evolution of the number of absorbers per 

unit redshift (dN/dz) of the form dN/dz = A x (1 + z) 7 continue to yield values of 7 
■ in the range 0.1 to 0.3, decidedly flatter than results from groundbased data pertaining 

to the redshift range z > 1.7. These results are consistent with our previous results 
based on a much smaller sample of lines, but the uncertainties in the fit have been 
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greatly reduced. The combination of the HST and groundbased data suggest a marked 
transition in the rate of evolution of the Ly-a lines at a redshift of about 1.7. The 19 
Ly-a lines from an additional higher redshift QSO from our sample for which tentative 
line identifications are available (UM 18, z em = 1.89), support the suggestion of a rapid 
increase at around this redshift. 

We derive the cumulative distribution of the full sample of Ly-a lines and show 
that the distribution in redshift can indeed be well represented by a power law of 
the form (1 + z) 7 . For this same sample, the distribution of equivalent widths of 
the Ly-a absorbers above a rest equivalent width of 0.1 A is fit quite well by an 
exponential. 

Comparing samples of Ly-a lines, one set of which has redshifts the same as, or 
very near to, the redshifts of ions from heavy elements, with another set in which no 
ions from heavy elements have been identified, we find that the Ly-a systems with 
heavy element detections have a much steeper slope than the high rest equivalent 
width portion of the Lyman-only sample. We argue that this result is not likely to 
be due to either line misidentification or incomplete spectral coverage. Considering 
the insensitivity of the equivalent width to large changes in the column density for 
saturated lines, we suggest that this result is probably attributable to rapid evolution 
of the very highest column density systems, rather than real differences in metallicity. 
We find evidence that the rate of evolution increases with increasing equivalent width. 

We compare our results for the variation of line density with redshift to recent 
numerical simulations of Ly-a absorbers, in particular, to those of Riediger et al. 199S 
which extend to zero redshift. We find fairly good agreement between these simulations 
and our results though the rapid evolution we find in the Ly-a systems containing 
heavy element ions is not predicted in their models. We speculate that these heavy 
element containing Ly-a systems involve those clouds closely associated with galaxies, 
whose column densities are too high and whose sizes are too small to be included in 
the Riediger et al. simulations. 

Our results for Ly-a lines at the high end of our equivalent width distribution are 



compatible with the recent analysis of the absorber-galaxy correlation by Chen et al 



1998. For the weaker lines however, our results suggest that whatever association 



exists between absorbers and galaxies is different from that for the stronger lines. We 
conclude with some suggestions for further observations. 



Subject headings: cosmology: observations — intergalactic medium — Quasars: 
absorption lines 
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1. Introduction 

The Hubble Space Telescope QSO Absorption Line Survey, a Key Project during the first four 
years of observations with the telescope, has resulted in the acquisition of ultraviolet spectra of a 
sample of 92 bright QSOs. These observations have yielded valuable information on the properties 
of absorption systems at low redshift. In particular, they have allowed preliminary estimates to be 
made of the rate of evolution of the Ly-a absorption systems. Discussions of our past results based 
on analysis of about 10% of the Ly-a absorbers in our data set can be found in our previous two 



'catalog papers" (Bahcall et al. 1993, Bahcall et al. 1996 ; hereafter CAT1 and CAT2). We have 



recently completed a significant expansion of the database available for analysis of the evolution 
of the Ly-a lines both in terms of the total number of lines available and in the range of redshift 
covered by the sample. This new database is described in the third of the catalog papers ( Jannuzi 



et al. 1998a| ; hereafter CAT3). We refer hereafter to the data set resulting from these 3 papers 



as the Catalog. This new database makes it appropriate to describe in more detail the results, 
methodology, and limitations of our analysis of the evolution of the low redshift Ly-a absorption 
systems, which is the purpose of the present paper. 

In two separate publications we will discuss the correlation properties of the Ly-a absorption 
systems flJannuzi et al. 1998b ) and the inferences concerning the ionizing background radiation 



which can be drawn through the "proximity effect" ( |Lu et al. 1998a| ). 

During the past few years there has been a maturing of attempts to understand the nature 
of the Ly-a absorption clouds, evolving from rather simple idealized models (e.g. pressure- 
confined clouds, gravitationally confined "mini-halos" , freely expanding clouds, caustics etc.) to 
sophisticated numerical simulations which are able to reproduce many of the features of the Ly-a 
absorption in the context of the evolution of structure. 

The spatial and mass resolution in current simulations is still too coarse and the processes 
involving star formation and the details of galaxy formation and evolution still too poorly 
understood to allow detailed predictions of absorbers on sub-galactic scales within galaxies. 
Moreover, simulations incorporating hydrodynamics are still so time-consuming that such 
simulations are just now being extended to zero redshifts. However, we are aware of one simulation 
which has been carried out to a redshift of zero (but without a full hydrodynamic treatment) at 
the level of detail and resolution so as to allow useful comparisons to be made between the data 
and the simulations ( [Ricdigcr et al. 1998j .) Continued rapid increases in computing power should 
allow detailed comparisons between full hydrodynamic simulations and observations near zero 
redshift in the near future. (A comprehensive list of references to simulations and a discussion of 
some of the issues involved in the simulations may be found in Ricdiger et al. 1998j .) 



Among the obvious and crucial areas for comparison are the distributions of line density as 
a function of redshift and rest equivalent width, and indeed this was one of the main initial goals 
of the QSO Absorption Line Key Project. We hope that the present results, in conjunction with 
the wealth of groundbased data at higher redshifts, will serve as a starting point for such detailed 
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comparisons. 

The organization of the remainder of this paper is as follows: In §2 we describe the subset 
of QSOs taken from the Catalog which are used in the present analysis. In §3 we discuss the 
applicability of "line counting" and maximum likelihood techniques to the analysis of the evolution 
of the Ly-a lines and describe the various parameters characterizing subsamples of the Ly-a lines 
used in our analyses. In §4 we present the results for the evolutionary rates for these samples. 
We discuss in §5 the distribution function of the Ly-a lines as a function of redshift and of rest 
equivalent width (hereafter REW). We also briefly discuss the corresponding distribution for Njji 
column densities for an assumed Doppler parameter. We give in §6 an alternative depiction of the 
evolution of the Ly-a lines by binning the data, and discuss the effect of adding lines from the 
single high redshift (z > 1.5) QSO (UM 18, z em = 1.89) for which we have a complete, though 
tentative, set of line identifications. In §7, we ask whether there are differences in the evolutionary 
rate for Ly-a systems having, and not having, detections of lines which can be identified with 
heavy elements and investigate whether there is a dependence of the rate of evolution upon the 
strength of the Ly-a lines. We conclude in §8 with a summary of the results, a comparison of our 
results with the numerical simulations of |Ricdigcr et al. 1998 , some implications of our results 



concerning the nature of absorber-galaxy associations, some speculations on multi-component 
Ly-a models and suggestions for further observations. Some details of the maximum likelihood 
code used in our analysis are described in the Appendix. 



2. The QSO Sample 

In CAT3 we present all previously unpublished spectra taken as part of the HST QSO 
Absorption Line Survey. However, for eight of the QSOs (all with z > 1.0, and including the 
highest redshift objects in the sample), while we were able to measure wavelengths and equivalent 
widths for the absorption lines, we have not yet produced a significantly complete list of line 
identifications based upon the rigorous identification algorithms presented in CAT1, CAT2, and 
CAT3. Because unambiguous identifications become increasingly difficult for the increasing line 
density of the higher redshift objects, and because the decomposition of blended features in a 
unique way also becomes very difficult at these redshifts, especially at the relatively low resolution 
of our spectra (230-270 km s _1 ), it is not clear if it will be possible to produce completely and 
securely identified line lists for this set of objects based upon our current data. For a ninth high 
redshift object, UM 18 (z em = 1.89) we have managed to complete a set of identifications for the 
measured lines, but for the reasons just stated pertaining to the eight objects mentioned above, we 
consider these identifications to be less robust than the identifications of the lines in the remainder 
of the Catalogue. For the purposes of this paper we consider the identifications for UM 18 to be 
tentative (see CAT3 for additional discussion of the line identifications for UM 18). 

For these reasons we have decided to summarize our conclusions concerning the evolutionary 
properties of the Ly-a lines based upon the completely identified line lists published in the 
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Catalog, not including the UM 18 line list for the majority of the analysis. However, we do briefly 
describe the impact of adding this higher redshift quasar on the results. 

Illustrations of all the spectra, the variation of the observed equivalent width of the line 
detection threshold as a function of wavelength, and the detailed line lists are all found in the 
Catalog. Note that in the present paper we only consider lines in the "complete sample" , which 
is defined to be those lines with a significance level, SL, of at least 4.5 ( |Schneider et al. 1993 , 



hereafter Paper II; CAT1; CAT3). The significance level is defined as follows: 

SL - lWl (I) 

where W is the observed equivalent width, and a(W) is the la error in the observed equivalent 
width of an unresolved line. Note that a(W) is calculated with the flux errors at the positions of 
strong absorption lines replaced by the errors interpolated from the surrounding continuum points 
(see Paper II). The significance level differs from the more familiar definition of signal-to-noise 
ratio, SNR = \W\/o~(W), because of the use of the interpolated error array for the calculation 
of SL. 

As noted above, there are several objects in the Catalog for which there are line lists, but not 
secure line identifications. In addition, there are a few objects which have deep broad absorption 
troughs (BALQSOs) and which have been omitted in the analysis of this paper; these objects will 



be described elsewhere ( Turnshek et al. 1998 ). Finally, there are a few objects where the spectral 
coverage was such that no region shortward of the Ly-a emission line redshift was observed. These 
objects too, are of course omitted from the present analysis, as are all the objects which were only 
observed using the low dispersion G160L grating, which has a resolution of ~ 1100 km s — 1 . 

There remain 63 QSOs (not counting UM 18) whose Ly-a lines comprise the data set analyzed 
here. In Table 1 we list these objects in order of emission line redshift. Column 1 lists the common 
name, columns 2 and 3 the 1950 coordinates, column 4 the emission line redshift, and columns 5 
and 6 the lower and upper usable redshifts for detecting a Ly-a absorption line. The lower usable 
redshift is defined by the low wavelength cutoff of the spectral coverage, (except that wavelengths 
shortward of 1218 A were not considered.) The upper usable redshift coverage was set by the 
smaller of the upper wavelength cutoff of the spectral coverage and the wavelength corresponding 
to the "proximity velocity " cutoff, discussed in §3. For Table 1 this cutoff velocity was chosen to 
be 3000 km s _1 . Additional information about each of the QSOs listed in Table 1 can be found in 
Table 1 of CAT3. 



3. Application of Maximum Likelihood Analysis to Line Counts 
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3.1. Is Line Counting Appropriate? 

In this section we describe the use of maximum likelihood estimators to obtain values of three 
parameters which characterize the distribution of the Ly-a absorption lines as a function of REW 
and redshift. From the point of view of the physical properties of the absorbing gas it would, of 
course, be preferable to know the distribution of Doppler widths and H I column densities, but at 
our resolution the lines are not resolved, so profile fitting is not feasible. Although use of the higher 
Lyman lines and curve-of-growth techniques would allow us to infer something about Doppler 
widths and column densities for systems in which the higher Lyman lines are observed, this subset 
of the data is substantially smaller than for Ly-a itself. Moreover, blending of components could 
very well vitiate these results. 

Additionally, at the observed line densities at the highest redshifts for objects in CAT3, not 
only do line identifications become more and more difficult, but, at the FOS resolution (~230-270 
km s _1 ), line blending becomes more and more severe, and the decision about when to decompose 
a given feature into several components becomes increasingly arbitrary — though reproducible — in 
terms of well-defined algorithms and reduction parameters (Paper II). More generally, some 
simulations call into question whether the concept of a "cloud" has much meaning. In this 
sense, it would be worthwhile to ultimately repeat our entire analysis based upon the flux-deficit 



method, as explicated in detail in Press et al. 1993. In both approaches the most difficult and 



least objective procedure still remains the drawing of the continuum. 

Notwithstanding these shortcomings and the fact that we must deal with equivalent widths 
rather than column densities, we take the point of view that simulated spectra can be reduced and 
analyzed using the same techniques as the real data. Direct comparisons between the equivalent 
widths in the real and simulated data can be made and the question of line counting vs. flux 
deficit is thus somewhat moot. 

Finally we have generally not attempted to evaluate the reliability of every line identified 
as Ly-a in the Catalog, but have taken a line to be Ly-a whenever it was the first identification 
appearing in the line lists of the Catalog, independent of any indications of blends or alternative, 
but less probable, identifications. The exception to this statement occurs where candidate C IV 
doublets are identified shortward of the Ly-a emission line and when the expected position of the 
corresponding Ly-a is not within our spectral coverage (or falls where the signal-to-noise ratio 
is very poor) and when there are no other lines identified at the same redshift as the candidate 
C IV doublet. In such cases the algorithms used for the line identifications cannot provide an 
unambiguous identification and we had to make a subjective decision in selecting the preferred 
identification (see §3.4 in CAT3). 

We have carried out simulations in order to estimate the probability that such C IV doublets 
are actually chance pairs of Ly-a lines which satisfy the criteria used by the line identification 
algorithm to accept a C IV doublet, and these simulations are described in detail in CAT2 and 
CAT3. (A similar ambiguity occurs for the O VI doublet, but in practice this has very little 
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impact on the line lists.) 

We have treated these relatively small number (46) of ambiguous C IV doublets which occur 
in the line lists for the objects considered in this paper in 4 different ways: (1) Regard all such 
possible C IV doublets as being two Ly-a lines. We refer to the line list treated in this way as 
the "maximum Ly-a line list" assumption. (2) Regard all such possible C IV doublets as actually 
being C IV. We refer to the line list constructed in this way as the "minimum Ly-a line list" 
assumption. (3) Based upon the probabilities for finding false doublets (see Table 4 of CAT3) 
we assign probabilities to the identifications of being either C IV doublets or pairs of Ly-a lines. 
Then, with a random number generator, assign each of these possible C IV doublets to be either 
two Ly-a lines or a C IV doublet on the basis of these probabilities. We refer to this case as the 
"statistical Ly-a line list". (4) We use the line identification appearing first in the CATALOG and 
refer to this as the "preferred Ly-a line list". 



3.2. Parameters Characterizing Ly-a Line Subsamples 

The maximum likelihood code used in the present paper is based upon the description given 



in the paper by Murdoch et al. 1986 but differs in a few significant ways. The formalism and 
these differences are described in detail in the Appendix. The explicit parameterization adopted 
to describe the evolution is given in equations A-l, A-2, and A-3 of the Appendix. We describe 
there also the determination of the distribution functions in redshift and rest equivalent width and 
the corrections for incompleteness. 

In this section we define various ways of dividing the Ly-a lines into subsamples. The results 
will be presented and discussed in §4. Most of our results are contained and summarized in 
a table (Table 2) in which each row of the table specifies and contains the results for one of 
the subsamples. The columns of Table 2 have the following meaning: Column 1 is the number 
identifying the sample. Columns 2-6 define the particular sample of Ly-a lines: Column 2 gives 
the number of lines in the sample, column 3, (labeled REW) gives both the limiting lower and 
upper rest equivalent widths accepted in that sample. The notation L,M,B in column 4 indicates 
whether the samples included: i) Ly-a lines in which there were no other lines identified with 
heavy elements at or near the redshift of these Ly-a lines (L); (ii) only such systems associated 
with heavy-element containing systems (M), or (iii) both types (B). By "associated" we mean 
that the Ly-a system in question lies within 300 km s _1 of a system (most often the same system) 
containing at least one heavy element ion.f^ 

In column 5 a "V" (for variable sensitivity) indicates that Ly-a lines were accepted regardless 
of the value of the 4.5 a REW detection limit at the wavelength in question (of course the line 



14 Note that the use here of the phrase "associated" is different from that sometimes used to denote absorption 
systems whose redshift is very close to the emission line redshift. 
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itself must exceed this limit to be accepted in the sample). As explained in the Appendix, the 
incompleteness in the "V" sample is corrected for by the fit to the exponential distribution of 
rest equivalent widths. Thus, the validity of the incompleteness correction in the "V" samples 
depends explicitly on the goodness of fit to an exponential in the distribution of rest equivalent 
widths. Alternatively, we also define a "U" sample (for uniform sensitivity), accepting only those 
lines whose REW is greater than some minimum value, but also limiting the spectral coverage 
in which lines are accepted, regardless of their REW, to regions having at least this minimum 
detection limit. In §4 we choose the value of this minimum detection limit in order to maximize 
the number of lines in the large "U" sample. We stress that the purpose of defining such U 
samples is simply to check that the values of 7 we find for the V samples are not strongly affected 
by the assumption of an exponential distribution of rest equivalent widths. For the U samples, 
the determination of the best fit for 7 is independent of any assumption about the distribution 
of equivalent widths above the adopted minimum detection limit for the U sample. However this 
independence is achieved at the cost of significantly decreasing both the number of lines and the 
spectral coverage. Column 6 is the "proximity value" cutoff: Using the emission line redshifts 
given in Table 1, we calculate the wavelength corresponding to a Doppler shift of V prox from the 
emission line redshift, where V prox is taken positive for wavelengths shorter than the Ly-a emission 
line. There are several reasons for treating with caution lines near the emission line redshift 
(which is itself uncertain when it is not based upon lines from the narrow emission line region, as 
is the case for many of the QSOs in Table 1). First, the ionization of the clouds may be influenced 
by the radiation field from the QSO itself, and indeed quantitative analysis of this effect is now 
a standard tool to use in estimating the metagalactic ionizing radiation field, as is discussed 
for the Key Project sample by Lu et al. 1998a. However, there are at least two other equally 
important reasons: (1) In the region surrounding a QSO, there is likely to be enhanced density, 
with an enhanced density of Ly-a lines, mitigating to some extent the ionizing effect of the QSO 
(cf. [Bechtold 19"9^ , [Williger et al. 1994| ) (2) Recent work— cf. articles in a recent PASP Conference 
Series ( parlow et al. 1997 , Hamann et al. 1997, Aldcroft et al. 1997) — clearly demonstrates 
that a significant fraction of such lines — perhaps the majority — are intrinsic to the QSO. Such 
absorption features — originally called "associated absorption" (cf. Foltz et al. 1988] ) — typically 
have very high levels of ionization, and they are seen frequently in O VI and C IV in the key 
project sample (cf. CAT3). Generally, their ejection velocities do not exceed 3000 km s , and for 
this reason we have adopted this value in all the samples in Table 2 except for sample 1. However, 
it is also now clear ( Hamann et al. 1997j ) that there are occasional instances of material ejected 
to very high velocities — presumably related to the Broad Absorption Line QSOs (BALQSOs). In 
any particular instance it is not obvious whether a given absorption feature is truly intervening or 
ejected, and indeed, high ionization troughs in one of the objects in our sample, PG 2302+029, is 
such an instance, as recently discussed by [lannuzi et al. 1996 . 



Columns 7, 8 and 9 contain the results and are, respectively, the line densities at z = 0, the 
slope, 7, and the characteristic rest equivalent width in the exponential distribution, w* . Also 
given are the la uncertainties in these three quantities, calculated according to the algorithm 
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discussed in the Appendix. Note that in column 7, where we give the line density at zero redshift, 
we give the line density for lines above a rest equivalent width of 0.24 A, using the value of w* 
given in column 9. Line densities normalized to any other lower limiting REW can be readily 
calculated by multiplying by exp[—(wu m — 0.24)/w*]. 

4. Results for Several Samples 

In this section we define several subsamples of the Ly-a lines and evaluate the maximum 
likelihood parameters for each. However, as discussed in §3, there is some ambiguity in the 
identification of a subset of lines which could be identified as either pairs of Ly-a lines or a C IV 
doublet. We defined in §3 four possible sets of identifications which we characterized as the 
"minimum", "maximum", "statistical" and "preferred" Ly-a line lists. As described below in 
connection with sample 2, we conclude that the results are relatively insensitive to which of these 
line lists we choose. 

The first sample, sample 1 in Table 2, presents the results using the preferred line list. For 
each of the spectra of the 63 QSOs of Table 1, (except for the Ly-a lines identified in the high 
redshift object UM 18, which are not included and are discussed in §6 below) sample 1 includes all 
Ly-a lines in the preferred list whose absorption redshifts are less than the emission line redshifts 
listed in Table 1. Sample 2 shows the results of eliminating those lines within 3000 km s^ 1 of the 
emission line redshift: because there are relatively few such lines, their inclusion or exclusion does 
not significantly alter the results of these large samples. 

In order to check the sensitivity of our results to the uncertainties in the line identifications 
noted above, we considered the same criteria for line selection used in sample 2, but used the 
minimum, maximum, and statistical line lists. The results for w* and the line density differ only 
very slightly. The results for the minimum line list yield a value of 7 of 0.08, slightly flatter than 
for the preferred list, but by an amount which is only one half of the formal la uncertainty in the 
fit. In the case of the minimum line list, for any given line of sight, Ly-a lines in the preferred 
line list, whose identifications are ambiguous (in the sense of Ly-a pairs vs. a C IV doublet) are 
removed at the low redshift range covered by that line of sight, since the ambiguity is significant 
only when the Ly-a line which would normally be required to confirm the C IV identification is at 
too low a wavelength to be observed at all, or is observed with very low signal to noise. This effect 
therefore tends to steepen 7. However the QSOs in whose lines of sight these ambiguities tend to 
occur most frequently are those at the higher redshifts, and thus, considering the entire sample, it 
is the higher redshift lines which tend to be removed from the preferred sample, which flattens 7. 
For the maximum line list, the results are in the opposite sense and we obtain a 7 of 0.24, again 
differing from the preferred line list value by only one half of the formal la uncertainty in the fit. 
Finally, the "statistical" line list gives a result which is close to, but slightly less than, the 7 for 
the maximum line list. Since all of these line lists yield values of 7 which do not differ by more 
than one half a from the formal uncertainty in the fit of 7, in the remainder of this paper we use 
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the preferred line list. 

In §3 we described the use of a uniform detection limit sample, which avoids any assumption 
about the distribution of rest equivalent widths. Sample 3 differs from sample 2 in utilizing such a 
uniform detection limit sample and shows that use of a uniform sensitivity sample also does not 
substantially alter the results for 7 for these larger samples. For sample 3 we chose the value of 
the minimum REW to be 0.24 A, which maximizes the number of lines in the sample: If we take 
a value larger than this, we expand the wavelength coverage but restrict the number of lines, since 
their density decreases rapidly with increasing REW. Conversely, if we lower this minimum REW 
further, we restrict further the wavelength regions having this more sensitive detection limit, and 
thereby accept fewer lines. In sample 3 we have therefore set the lower limit of the REW for lines 
accepted in the sample to be 0.24 A and thus also to regions of the spectra having at least this 
sensitivity. Since there are significantly fewer lines, and more restricted wavelength coverage, the 
formal errors are larger than in sample 2. 

As discussed in §7 it has been traditional to separate the Ly-a lines into "Lyman-only" 
systems and those containing at least one heavy element line (or "associated" with such a heavy 
element-containing system in the sense defined above). Samples 4 and 5 are for V and U samples 
comprising just the Lyman-only systems. Once again, due to the fact that such lines are in the 
heavy majority, the results do not differ much from sample 2. 

The most striking thing about the results in the first five samples of Table 2 are the very 
low values of 7 which are much lower than was initially expected, on the basis of groundbased 
samples, when the Key Project quasar absorption line survey was conceived. These low values 
were suggested in our preliminary analysis in both CAT1 and CAT2, but are now even smaller, are 
based upon a much larger number of lines, and now extend to redshifts of 1.5. Similar preliminary 
results, based upon spectra of two QSOs of lower resolution and a very much smaller sample 
of lines, were obtained by Impey et al. 199€| . More recently, a preliminary analysis based upon 



archival FOS-HST data has been presented by Dobrzycki fc Bechtold 1998| with similar results. At 



some redshift, however, we certainly expect an upturn in the rate of evolution if these results are 
to be compatible with the groundbased results. We return to this topic in §6. 

The slow rate of evolution we observe seems in conflict with the visual impression resulting 
from inspecting the spectra of the QSOs at z « 0.3 and z « 1.0 (see figures in CAT1, CAT2, 
and CAT3). The z em > 1.0 QSOs certainly appear to have many more strong absorption lines 
blueward of the QSO Ly-a emission line than does, for example, the spectrum of the low redshift 
QSO 3C 273. Three factors contribute to this impression. First, while the spectra obtained in 
the survey are of fairly uniform signal-to-noise ratio per resolution element (about 30) over most 
of the wavelength coverage, this results in a lower rest equivalent width detection limit in the 
spectra of the higher redshift objects than it does in the lower redshift objects. Moreover, redshifts 
from ~0 to about 0.35 are seen only in the G130H spectra, which sometimes have somewhat 
lower S/N than the G190H and G270H data, especially at the lowest redshifts. This results in 
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correspondingly fewer lines at low redshift. Second, there are more heavy element systems detected 
in the spectra of the higher redshift objects (in part because of a larger number per unit redshift, 
but also because there is more observed path length at the higher redshifts) and each of these 
systems can contribute 3 to 20 absorption lines to the spectrum. Finally, the higher order Lyman 
series lines (Ly-/3, Ly-7, etc.) are more readily observable in the spectra of the higher redshift 
QSOs, due to a larger observed path length and the more sensitive rest equivalent width detection 
threshold. All of these effects are evident in the spectrum of the z cm = 0.770 QSO PG 1538+477 
(see CAT3 and figure 3 of Jannuzi 1998]) . 



We illustrate the rest equivalent width detection limits in Figure 1, where we reproduce from 
CAT3 both the observed 4.5a detection limit and the rest 4.5a detection limit for typical spectra 
resulting from the 3 gratings used here. At a redshift of 1.5, even for observed equivalent width 
limits which are comparable, the minimum detectable rest equivalent width is lower by a factor of 
2.5 than it is at zero redshift. Because of the fairly rapid decline in the distribution of Ly-a lines 
with increasing rest equivalent width, these numerous weaker lines are not detectable at the lower 
redshifts. All of these effects combine to give one the false visual impression of a rapidly evolving 
population of Ly-a lines. 



5. The Distribution of Ly-a lines in Redshift, Rest Equivalent Width and Column 

Density 

5.1. The Cumulative Distribution Versus Redshift 

The values of 7 and w* summarized in Table 2 are based upon the assumption that the 
distribution of the Ly-a lines can be reasonably well represented by a power law in (1 + z) and 
an exponential in the rest equivalent width. Since the effective path length in redshift space is 
affected both by the number of lines of sight associated with each grating, the distribution of 
emission line redshifts in our sample, and the variation in minimum detectable equivalent width as 
a function of wavelength for a given grating, there are significant differences between the observed 
redshift distribution of the Ly-a lines and the "true" distribution (i.e., the observed distribution 
corrected for the non-uniform coverage in sensitivity and wavelength.) The same is true for the 
distribution of rest equivalent widths. The observed distribution of lines in redshift and REW is 
shown in Figure 2, where we have also plotted the number of sightlines as a function of redshift. 

In the Appendix, we describe the algorithms used for making these corrections. In displaying 
the results we could, following for example Storrie-Lombardi et al. 1996| , plot the cumulative 



distribution vs. the "effective path length" , in which the cumulative distribution function increases 
by an equal amount for every Ly-a line. Alternatively, we could weight the lines in forming 
the cumulative distribution function, as described in the Appendix, and plot this cumulative 
distribution directly as a function of redshift. The former has the advantage that a statistical 
test for goodness of fit (e.g., the KS test) can be rigorously applied, whereas the latter has the 
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advantage that "redshift" is much more readily interpretable than "effective path length" as the 
abscissa. We opt for the latter case. Where the data are sparse (e.g., between the redshifts of 
0.31 and 0.35 corresponding to the region from 1600 to 1650 A where the S/N is generally very 
poor — see Figure 1) each line is weighted more heavily than in adjacent regions and causes a 
jump in the ordinate of the distribution function which is larger than for a line in a well-sampled 
redshift interval. 

Nevertheless, as shown in Figure 3, it is evident that the power law fit to the data of sample 1 
is a remarkably good one. The application of the KS test to the data displayed in Figure 3, while 
not rigorous for the reasons discussed in the preceding paragraph, yields a probability of 36% 
that a KS statistic > to the observed one would arise from a random sampling of true power law 
distributions with the same slope, thus confirming the visual impression of the goodness of fit. 
Since it is rather unlikely that the evolution of such complex physical processes as those which 
produce Ly-a lines in QSO spectra should be represented accurately by a power law, we do not 
pay particular attention to whether a better representation might be found — as always, our point 
of view is that the empirical (but corrected) distribution function is to be compared with detailed 
numerical simulations. 



5.2. Distribution of Equivalent Widths 

A similar analysis can be carried out for the distribution of equivalent widths. In this case, 
it is more instructive to compare the raw and corrected differential distributions in bins of 0.1 A. 
This is shown in Figure 4. The fit is quite good. There may be a very slight excess of lines in the 
first bin, though this is not statistically significant. The fact that the exponential is a fairly good 
fit follows from the convolution of a power law in the distribution of H I column densities over a 
range of Doppler parameters, as pointed out by [Murdoch et al. 1986 and Press et al. 1992 . 



Since most simulations are presented in terms of the H I column density, however, we make a 
very rough attempt to translate the results of Figure 4 from equivalent width to column density. 
This requires knowledge of the Doppler parameters which we lack. There is to date only limited 
information on the distribution of Doppler parameters at low redshifts, but the existing data 
suggest it is not very different from that found for higher redshift systems. In Figure 5 we convert 
equivalent widths to column densities for a single value of the Doppler parameter of 30 km s _1 , 
though there are clearly instances of lines having both larger as well as smaller values. The 
approximate slope of the differential column density distribution, dlogN / dlogNni is —1.3 at the 
low column density end, consistent with values found from high resolution studies at high redshifts 
(cf. |Kim et al. 1997 ). In Figure 5 we also plot the curve-of-growth for Ly-a relating equivalent 



width and column density for 3 Doppler parameters (20, 30, and 40 km s _1 ), and from these 
curves the influence of the Doppler parameter upon the inferred column density distribution can 
be estimated. 
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Thus we conclude that the large samples are quite well represented by a power law in redshift 
and an exponential in rest equivalent width. There remains, however, the question of whether the 
rate of evolution might vary as a function of line strength or metallicity: we defer discussion of 
this question until §7. 



6. Is There Evidence for an Upturn in the Rate of Evolution with Increasing 

Redshift? 

As noted in §4, at some point we should expect that an upturn should exist in the rate of 
evolution of the Ly-a lines deduced from the Key Project data if they are to match smoothly onto 
the results obtained from groundbased spectroscopy. There was some slight hint of this from the 
data based upon the smaller and lower redshift samples comprising CAT1 and CAT2. However, 
when we bin the data from the substantially larger database represented by sample 2, there is no 
convincing evidence for this up to a redshift of 1.5, as shown in Figure 6. In Figure 6 we have 
divided the redshift space between and 1.5 into five bins spaced equally in log(l + z). Within 
each bin, we evaluated dN/dz at the middle of that bin, fixing both 7 and w* to have the values 
for the global fit of sample 2. (The resulting values of dN/dz are very insensitive to the value of 
7 assumed since the values of dN/dz plotted in Figure 6 are centered between the limits of each 
bin.) Figure 6 confirms the impression of a very flat rate of evolution over the range z = 0.0 to 
1.5 suggested by Table 2. The same procedure using the U sample (sample 5) gives very similar 



results. In Figure 6 we have also plotted groundbased results from an early analysis by Lu et al 



1991| and more recent results by |Bechtold 1994| , based upon data which is similar to ours in terms 
of line density per resolution element in the region where the groundbased and HST data join. The 
slope found in that analysis was 1.85. (A recent analysis based upon much higher resolution and 
signal to noise spectra using Ly-a lines above redshift two, and whose strengths are comparable 



to those in our sample yielded a steeper slope, 7 = 2.78; Kim et al. 1997 ). Taken at face value, 
Figure 6 suggests that in the redshift range from about 1.3 to 1.7, which bridges the gap between 
our FOS data and the lowest redshifts available from the groundbased data, the rate of evolution 
increases rather abruptly. 



6.1. The Addition of UM 18 to the Sample 

While both the character of the data and the analysis in the Bechtold data are not very 
different from our HST data and analysis, it would be preferable to have a uniform data set and 
analysis which spans this redshift range. In particular, adding Ly-a lines in the redshift range 
between z ~ 1.5 and z ~ 1.7 from the Key Project objects is very important, as it will fill in the 
gap between our currently well-identified sample of absorbers and the large groundbased samples 
that have been compiled. 
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After the bulk of the analysis for this paper had been performed, identifications of the lines 
in the spectrum of one of these objects, UM 18 (z cm = 1.89) were completed (CAT3). Since line 
blending and the location of the continuum start to pose significant problems in objects at these 
redshifts, at the FOS G270H resolution, we must regard the line lists from UM18 as considerably 
more uncertain than those for the other 63 QSOs we have discussed. Therefore, we have chosen 
not to include UM 18 in most of the analysis. We have, however, run a sample identical to 
sample 2 in which all the UM 18 Ly-a lines are included. Although UM 18 has a large number of 
lines at redshifts below z=1.5, the overwhelming number below this redshift still come from other 
lines of sight, and the inclusion or non-inclusion of UM 18 lines at redshifts below this redshift 
makes no significant difference in either the global fits in Table 2 or the binned data of Figure 6. 

However, when we extend the binned data to include the bin from 1.5 to 1.69 (corresponding 
to 3270 A, the red end of the G270H spectral coverage) then the 19 lines contributed by UM 18 in 
this region suggest a sharp upturn in the rate of evolution. The bin spanning this redshift range 
and containing these 19 lines is also shown in Figure 6. Because of the small number of lines in 
this bin the upturn in the line density from what extrapolation of the data up to redshift 1.5 
suggests is only significant at about the 1.5<r level. While it is thus somewhat reassuring that 
there is a moderately strong indication from UM 18 that we are starting to see this upturn in our 
Key Project data, it will be very important to see if this upturn is indeed as sharp as suggested 
from the current data, as the remainder of the large number of lines in the CAT3 line lists become 
reliably identified. Evidently, the redshift regime between about 1.5 and 1.8 is a very interesting 
one for further study with both HST and with groundbased telescopes. 



7. Does Evolution Rate Depend on Metallicity or Line Strength? 

Given the wide range of physical conditions and processes which can give rise to observable 
Ly-a absorption, it would be somewhat surprising if the rate of evolution did not depend to some 
extent on various properties of the Ly-a absorption such as degree of association with galaxies, 
metal content, and line strength. In this section we discuss the latter two possibilities. 



7.1. Does Evolution Rate Depend Upon the Presence of Heavy Elements? 

It has been customary to treat the Ly-a absorption systems in which no other ions are found 
separately from those in which ions from heavy elements are detected. Historically, this arose 
from the original suggestion of [Sargent et al. 1980 that the low column density Lyman-only 
systems represented unprocessed material and formed a population distinct from absorption 
systems containing, e.g. C IV. This idea was supported by the apparently quite different clustering 
properties of the two types of systems. 



The advent of very high resolution, high sensitivity groundbased spectra has cast some doubt 
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on this point of view, since C IV systems seem to generally be found at H I column densities 
down to ~ 3 x 10 14 (Cowie et al. 1995) below which it has been presumed that the C IV lines 



become too weak to detect individually. It has further been suggested ( Cowie et al. 1995| ) that 



there is a "floor" in the metallicity of the Ly-a forest lines of order 10 -2 solar, reflecting perhaps 
general contamination from the first epoch of star formation. On the basis of the coaddition of a 
large number of weaker Lyman lines (in the H I column density range 10 13 ' 5 — 10 14 ) though, it 
was found that the ratio of C IV to Ly-a strengths become very much smaller, either because of 
very much lower (or zero) metallicity, or conceivably because of an abrupt change in the level of 



ionization ( Lu et al. 1998b ). However, using a different technique, Cowie and Songaila ( 1998 ) find 
that the metallicity (as judged by the C IV/Ly-a line ratios) does not drop at very low column 
densities (and thus in regions of nearly zero over-density) . 

The foregoing all refers, of course, to much higher redshift systems than in our HST sample, 
and, in view of the rather unexpected flatness of the rate of evolution of the total ensemble of 
Ly-a lines (as reflected in sample 2 of Table 2), it is of interest to ask whether there is any evidence 
that the Ly-a systems in which one or more ions of heavy elements have been detected behave 
any differently than the Lyman-only systems, insofar as their rate of evolution is concerned. 
We stress that with the sensitivity and limited spectral coverage of our FOS data, the lack of 
detection of lines from heavy elements does not imply that these systems truly lack heavy elements 
or are even strongly metal deficient. Most, perhaps all, may well be found to contain weak 
heavy element lines with STIS and/or groundbased resolution and sensitivity. Thus, although we 
will continue to refer to this set as "Lyman-only" systems, this should be understood to mean 
"systems with no identified heavy element ions due to inadequate sensitivity or spectral coverage, 
or low heavy element abundance." The much smaller set of Ly-a lines for which heavy element 
ions have been identified at the same redshift we designate as LWHED lines (Ly-a Lines With 
Heavy Element Detections). We also include in this category the very few Ly-a lines without 
heavy element ions identified at the Ly-a redshift, but whose redshift is within 300 km s _1 of 
heavy element-containing systems, since it seems very likely that such systems are physically 
associated with the heavy element-containing systems. In defining the LWHED samples it is 
also crucial to exclude lines within 3000 km s _1 of the emission line redshift, since, as remarked 
previously, systems showing C IV and O VI near the emission line redshift are quite common 
and have a high probability of being intrinsic to the QSO. In our total sample of Ly-a lines with 
Vp rox = 3000 km s _1 , there are a total of 78 systems in which one or more heavy element ions 
have been identified. There are an additional five "associated" Ly-a lines (as defined in §3.2) for 
a total of 83 which we place in the LWHED category. Among these 83, however, are some found 
in the possible BALQSO PG 2302+029 which has broad weak troughs of O VI, N V, C IV and 



probably Ly-a and is described in detail in Jannuzi et al. 199£ . Whether these troughs represent 



intervening or ejected material is not known, but if the troughs are interpreted as high velocity 
ejected material, then, since sharp components are often found to accompany the broad troughs, 
for the purpose of the following discussion we have dropped PG 2302+029 from the samples 
discussed in this section. There remain 79 LWHED systems; the five "associated" systems and 
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74 Ly-a lines having redshifts identical, within the measuring errors, to the redshifts defined by 
the heavy element lines themselves. Although comparison of the large "V" and "U" samples 2 
and 3 give quite comparable results, the smaller samples become increasingly vulnerable to the 
assumption of exponential fits to the equivalent width distribution. This is especially true of the 
LWHED systems, where the e-folding REW, w*, is much larger than in the other samples. In 
samples 6 and 7 therefore, we show the results for both the "V" and "U" assumptions for the 
LWHED lines. In the latter case we have chosen the minimum equivalent width cutoff to maximize 
the number of lines, and this occurs at a rest equivalent width of 0.40 A. We find a very much 
more rapid rate of evolution (7 ~ 1.2 — 1.6) for these two samples, respectively, than for the total 
sample (e.g., sample 27 = 0.16). Because of the small number of lines the uncertainty of 7 for 
the LWHED samples is now substantially larger. The median REW and characteristic equivalent 
width, w*, are also both very much larger than the full Ly-a sample. 

Because the median REWs of sample 2 are significantly smaller than that of the LWHED 
sample 6, we do not necessarily expect to detect heavy element lines in many of the weaker 
Ly-a lines of sample 2, given the FOS sensitivity, even if there were no significant differences in 
metallicity. One might expect therefore, that the difference in rates of evolution between sample 6 
vs. sample 2 simply reflects differing rates of evolution between low and high equivalent width 
samples, and has very little to do with metallicity. 

It would not be unexpected that in a subset of the Lyman-only lines drawn from the high 
end of the equivalent width distribution of the Lyman-only sample, we would obtain a rate of 
evolution which was comparable to the LWHED samples, and we might then attribute the lack 
of detected heavy element lines in such a sample to, for example, the expected position of C IV 
falling outside our wavelength coverage. 

It is not possible to find a sample of high REW Lyman-only systems which exactly matches 
the equivalent width distribution of the LWHED systems and still have a sample of reasonable 
size. However, considering just lines in the Lyman-only sample with REW > 0.70 A (samples 8 
and 9 for the V and U samples, respectively) we may compare the 0.25, 0.50 and 0.75 percentiles 
in the REW distributions with the following results: For samples 6 and 8 (the V samples for the 
LWHED and Lyman-only samples) the REWs for these three percentiles are 0.48, 0.80 and 1.09 
vs. 0.74, 0.83 and 0.99, respectively, while the corresponding comparisons for the U samples 7 
and 9 are 0.63, 0.89 and 1.15 for the LWHED systems vs. 0.74, 0.82 and 0.99 for the Lyman-only 
systems. 

Comparison of the maximum likelihood values for 7 in sample 6 vs. sample 8 as well as 
sample 7 vs. sample 9 shows that the LWHED systems seem to be evolving much more rapidly 
than the Lyman-only systems, though the errors are large due to the small number of lines. 
Utilizing the formal 1<t errors given in Table 2, we find formal differences in the rate of evolution 
of the Lyman-only and LWHED systems exist at about the 2.2a and 2.4<r level for the V and 
U samples, respectively. 
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Recognizing that the apparent differences in rates of evolution between the LWHED systems 
and the high end of the REW distribution for the Lyman-only systems are not highly statistically 
significant it is nevertheless of interest to consider whether selection effects might artificially give 
rise to this apparent difference. Or, if the difference is real, what the likely interpretation is. Since 
a detailed discussion of the properties of the heavy element systems in the Key Project sample 



will be presented in a separate paper ( Sargent et al. 1998 ), we will give only a brief and mostly 
qualitative discussion. 

There is one effect which might cause systems which we have considered to be LWHED 
systems to actually be Lyman-only systems: Accidental groupings of real Ly-a lines might lead to 
false identifications of metal line systems satisfying all the requirements of the line identification 
algorithm. As discussed in §3, and in detail in CAT2 and CAT3, when the candidate C IV doublet 
is in the Ly-a forest, and when the Ly-a line which would be present at the same redshift as the 
candidate C IV is beyond our spectral coverage, then of order 50% of these candidate C IV systems 
may be spurious. Substituting pairs of Ly-a lines for the C IV identifications thus increases by 
a few percent the total number of Ly-a lines in the total Ly-a sample (and in the Lyman-only 
sample), but, as discussed in §4, has only a very small effect on the maximum likelihood fit 
parameters. Altering the identifications of such cases does not change the composition of the 
LWHED sample, since, by definition, both heavy element ions and Ly-a must be present. However, 
the discussions in CAT2 and CAT3 also demonstrate that when a candidate C IV system is found 
and when the Ly-a in the same system is accessible, i.e., when it is a candidate LWHED system, 
then the number of false identifications per spectrum is very small. Thus, given the total number 
of LWHED systems in samples 6 and 7, it is unlikely that misidentification of accidental spacings 
of a very small number of Ly-a lines as heavy element lines significantly affects our results. 
Conversely, one might also suppose that the lack of heavy element detections in samples 8 and 9 
simply reflects biases in the spectral coverage, and that with much broader spectral coverage, but 
the same sensitivity, nearly all the Lyman-only systems of samples 8 and 9 would have heavy 
element detections. 

Granting that this could be the case however, we would still be faced with the question of 
why the LWHED and these Lyman-only systems seem to have different evolutionary rates, and 
whether the bias introduced by the incomplete coverage is responsible for this difference. 

The following argument suggests that biases in spectral coverage are not likely the explanation 
for the effect we observe. In the range of rest wavelengths accessible with the gratings used in our 
sample, the C IV doublet is the most ubiquitous ion found, as is well known from groundbased 
studies. The other most prominent lines (or doublets) are, in order of decreasing rest wavelength: 
the Si IV doublet (AA1393,1402), C II A1334, Si II A1260, the N V doublet (AA1238,1242), 
Si III A1206, the O VI doublet (AA1032,1037) and C III A977. 

If all of these ions were roughly equally likely to occur, then the lack of spectral coverage 
would have little effect on the values of 7 in samples 8 and 9. The Si II A1260 line, for example, 
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representative of low ionization systems, is nearly always accessible when Ly-a is itself. The same 
is true for Si III A1206, typical of intermediate ionization systems. The lack of coverage which 
would lead to the inability to detect the high ionization C IV doublet for Ly-a redshifts above 
about 1.1 would be roughly compensated for by the ability to detect O VI for Ly-a redshifts above 
about 0.65, for objects in which G190H data exist, and above about 1.2 for those objects in which 
only G270H data exist. 

However, to the extent that C IV is the most frequently occurring and readily detectable heavy 
element ion, extending the spectral coverage to wavelengths beyond A3270 (corresponding to C IV 
redshifts above 1.1) would preferentially remove systems at high redshifts (z > 1.1) in the putative 
Lyman-only systems and transfer them to the LWHED systems. This would tend to steepen 
the LWHED evolutionary rate and flatten the Lyman-only rate. To make a semi-quantitative 
estimate of the magnitude of this latter effect, we define one additional sample of Lyman-only 
lines (sample 10, Table 2) which has precisely the same sensitivity limits and path length coverage 
as the LWHED sample 7. From these two samples we calculate the fraction of Ly-a lines in which 
the C IV ion is seen in those cases in which it was accessible (about 12%). For every Ly-a line in 
sample 10 for which the redshift is such that C IV was not accessible we used a random number 
generator and this fraction of 12% to produce a set of "virtual line lists" in which we imagined 
that the C IV line was sometimes detected in instances where it was not accessible in the real 
line lists. The average values of 7 resulting from this procedure for what we might call "virtual 
samples 7 and 10" (but not shown in Table 2) are 1.95 and 0.16 respectively, (with uncertainties 
in 7 about the same as the real values) compared to the actual values of 1.55 and 0.25. The effect 
is in the anticipated direction — increasing the spectral coverage would preferentially discover 
high redshift C IV lines, and move high redshift systems from the Lyman-only category into the 
LWHED category, thus increasing the difference in the two gammas. We find that the difference 
in the 7's between these two virtual samples is significant at about the 2.8a level. 

One additional possibility which might give rise to the different rates of evolution found for 
samples 6 vs. 8 or 7 vs. 9 would arise if the high ionization systems which cluster around the 
emission redshifts, most of which appear to be intrinsic to the QSO, have a significant high-velocity 
tail which extends well beyond 3000 km s _1 . Removing such systems would preferentially remove 
the highest redshift systems along each individual line of sight and thus lower the 7 of the LWHED 
systems. Analysis of the actual distribution in "ejection velocity" space shows that this is not 
likely to be an important effect: In the 5000 km s _1 bin between —2000 and 3000 km s _1 (the 
"in-falling" systems probably represent the difference between the true systemic velocity and that 
inferred from the broad emission lines) there are 19 LWHED systems. In the next two 5000 km s _1 
bins (from 3000 to 8000 and 8000 to 13,000 km s _1 ) there are 4 and 5 such systems respectively, 
and the numbers remain at about this background level for the higher velocity bins. Evidently, 
there is no very significant high velocity tail which could likely give rise to the possibility just 
described. 



While the foregoing is hardly a rigorous analysis, it seems unlikely that effects primarily 
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attributable to incomplete spectral coverage, line misidentification, or a high velocity tail of 
intrinsic systems can account for the LWHED rate of evolution being steeper than those exhibited 
by the Lyman-only systems. Thus, if one is willing to accept these results as statistically 
significant, what is the most likely explanation for this? It is of course possible that the LWHED 
and high- RE W Lyman-only systems really represent differences in metal content. Only by 
improved sensitivity, higher resolution, larger samples and extended spectral coverage can this be 
confirmed. A program which can and should be readily carried out is to obtain the appropriate 
groundbased data in order to extend the data base for the C IV doublets which actually do 
accompany the Ly-a lines in our samples 8, 9 and 10 above redshifts of 1.1. 

We believe a more likely explanation for the difference in the evolutionary rates between the 
LWHED and Lyman-only systems than metallicity differences, is simply that there is a significant 
difference in H I column densities between the LWHED systems and the Lyman-only systems of 
samples 8 and 9, despite our attempt to match the two samples in terms of the distribution of 
equivalent widths. As examination of the curves-of-growth in Figure 5 makes clear, for the entire 
range of equivalent widths represented by nearly all the Ly-a lines in samples 6-10, the equivalent 
widths give virtually no reliable information at all about the H I column density. This insensitivity 
is worsened by blends which cause blended lower column density systems to masquerade as very 
high column density single systems. The LWHED systems probably simply act as markers to 
pick out those systems with H I column densities large enough for the heavy element ions to be 
detected at our FOS sensitivities. We would then conclude that over the 2-3 orders of magnitude 
range in H I column densities represented by samples 6-10 there is a systematic increase in the 
rate of evolution with increasing H I column densities. Support for this view comes from the rate 
of evolution of the Lyman limit systems which occur fairly frequently among the stronger LWHED 
systems having several different ion species (CAT3). The Lyman limit systems were found by 
ptengler-Larrea et al. 199~5 to have values of 7 in the range 1.0-1.5, depending upon the particular 



sample analyzed. 



7.2. Does the Rate of Evolution of the Ly-a Lines Vary with Ly-a REW? 

Despite the insensitivity of the REW to H I column density for all but our weakest lines, 
we can attempt to examine whether there is any evidence within our sample for a dependence 
of the rate of evolution upon rest equivalent width. In view of our conclusion above concerning 
the reason for the difference in evolution rate between the Lyman-only and LWHED systems, we 
ignore the distinction between the Lyman-only and LWHED systems and divide up the 920 Ly-a 
lines in sample 2 into five REW bins, whose boundaries are chosen so that for the five samples 
each bin contains the same number (184) of Ly-a lines. This leads to REW bin boundaries (in A) 
of 0.06, 0.22, 0.32, 0.45, 0.64 and 00. (There are no lines detected with REW less than 0.06 A in 
our sample.) Except for the highest two REW bins, the spread in rest equivalent width is not 
large enough to allow the parameter w* to be well-determined for the individual bins. We have 
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therefore fixed the value of w* to the value found in sample 2 (0.273). The resulting values of 7 
depend only weakly on the actual value of w* selected, as long as this value does not fall much 
below about 0.20. For the higher equivalent width bins, the samples are virtually identical for 
the U and V samples, but for the lower REW bins both the sample size and redshift range drop 
strongly for the U samples (the U sample has no meaning for the lowest REW bin, since there 
is virtually no coverage over which the minimum detectable REW is everywhere < 0.06 A.) We 
thus consider only V samples. We estimate the errors in 7 due to the formal uncertainty in the 
fit, together with the sensitivity to the assumed fixed value of w* , to amount to about ±0.50. The 
resultant values of 7 vs. rest equivalent width are shown in Figure 7. Figure 7 is suggestive of an 
increase in the rate of evolution with increasing equivalent width, in accordance with expectations 
if our preferred explanation for the difference in the LWHED and Lyman-only systems is correct, 
though the trend exhibited by the first 4 bins is not continued for the highest REW bin. This 
result is qualitatively in accord with the analysis of Dobrzycki & Bechtold 1998 as well as the 
analysis of Kim et al. 1997 at high redshifts based upon high resolution data. 



8. Summary and Discussion 

8.1. Summary of Results 

Using the homogeneous sample of HST-FOS spectra obtained during the QSO Absorption 
Line Key Project, and the set of lines measured and identified using objective and reproducible 
procedures, we have analyzed a subset of 63 of these spectra to investigate the evolution of the 
Ly-a lines in the sample. The subset of these objects and associated line lists are those objects 
excluding the strong broad absorption line QSOs, QSOs whose spectral coverage did not cover the 
region blueward of Ly-a, QSOs whose line identifications are not yet complete, and those QSOs or 
portions of QSO spectra that were only observed using the low dispersion G160L grating, having 
a resolution of ~ 1100 km s . 

Our main result is that over the redshift range from to 1.5, the rate of evolution, as 
parameterized by 7 in the expression (1 + z) 7 , is very much flatter than found from groundbased 
data which involve redshifts higher than about 1.7 and for which estimates in the literature have 



varied over a wide range of values. In particular, the analysis by [Bechtold 1994] yielded a value of 
7 of 1.85 (see Figure 6). The fits to the groundbased data and our data intersect at a redshift of 
about 1.5-1.6. 

The values of 7 we find (e.g., 0.16 for sample 2) are in general smaller than what is expected 
for clouds whose product of (comoving) number density and cross-section is constant, as might 
be expected for, e.g., stable clouds in galaxy disks or halos. For the currently popular value of 
ft ~ 0.2, under this assumption the value of 7 varies only slightly, from 0.9 at zero redshift to 
about 0.85 at a redshift of 1.5. Our data thus implies that the overall ensemble of Ly-a absorbers 
evolve so that this product becomes smaller with increasing redshift. 
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The fit to a power law in (1 + z) for the larger samples is quite good, as is the fit for the 
distribution of rest equivalent widths by an exponential. 

We find two exceptions to the flat rate of evolution described above: 

1) When we examine the Ly-a lines from the single high redshift QSO, UM 18, for which 
complete line identifications have been made, the line density in the redshift bin between z = 1.5 
and z = 1.7 apparently shows a strong upturn in the rate of evolution (though the identifications 
are still somewhat uncertain and there are only 19 lines in this bin). 

2) Samples consisting of Ly-a lines which have ions of the heavy elements identified at 
the same redshift as the Ly-a lines, or which are within 300 km s _1 of the redshift of systems 
containing such ions (the LWHED samples 6 and 7) show a much faster rate of evolution than the 
large total samples (e.g., sample 2) or the Lyman-only samples (e.g., samples 4 and 8) even when 
a subset of the Lyman-only systems crudely matching the rest equivalent width distribution of the 
LWHED set are considered. Discussion of selection effects suggests that line misidentification and 
incomplete spectral coverage is probably not responsible for this difference. In fact, simulations 
suggest that the difference between the LWHED samples and the Lyman-only samples increases 
when account is taken of the incomplete spectral coverage of our data. 

We find evidence that the rate of evolution increases with increasing REW. 



8.2. Discussion 

As noted in the Introduction, there have been significant advances in the detail with which 
simulations of the properties of the Ly-a absorption lines can now be carried out. It is the case, 
though, that high resolution simulations incorporating small (galaxy-sized) scales, high column 
densities, and shocks, and requiring the incorporation of both hydrodynamic and many-particle 
gravitational processes, have largely been confined to redshifts above about 2.0, well above the 
redshift regime of our data, and are just now being extended to redshifts of 0.0. 

There is a general consensus, though, that if one avoids the smallest scales, and higher column 
densities, then the baryons trace rather well the collisionless particles in the cold dark matter 
scenarios. Riediger et al. 1998| (hereafter RPM) took advantage of this to carry out simulations 



using a many-body code to obtain results to z = 0.0, allowing for shock heating in an approximate 
way. We therefore limit comparison of our results to this paper, in which comprehensive references 
to other recent simulations may be found J 1 ^ 

By also modeling the evolution of the UV background radiation, which obviously plays a 



15 We have recently learned that a full hydrodynamic simulation to zero redshift has been carried out by Dave et al. 
1998. This work also agrees with our result in finding a decrease in the rate of evolution at redshifts of roughly 1.5, 
as well as agreeing qualitatively with our result that the weaker lines evolve more slowly than the stronger ones. 
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major role in determining the HI column density, their simulations yield the evolution of line 
density with redshift (for lines with log Njji > 10 14 ) — see their Figure 5 — which is quite similar to 
our Figure 6. (For a Doppler parameter of 30 km s _1 , this column density implies a rest equivalent 
width of about 0.25 A — see our Figure 5). Since the thermal history of the gas plays an important 
role in the simulations, which is in turn affected by whether the gas has been shocked or not, RPM 
distinguish two populations of absorbers ("P u " and "P s "), in which material in the first population 
has not been shocked, and material in the second population has been. (In reality, there is no 
doubt a continuum of properties rather than a sharp dichotomy between two such populations.) 

The RPM simulations indicate that the unshocked material is found primarily in under-dense 
regions, and in the more outlying regions of condensations, while the shocked population is found 
in the vicinity of regions undergoing condensations to clusters and individual galaxies. RPM 
further suggest that it is among the shocked population that metal enrichment will be found. 
Their simulations show that the unshocked population evolves the more rapidly with increasing 
redshift and thus dominates the absorbers at high redshifts, while it is the shocked population that 
has the flat evolution and dominates at low redshifts (see their Figure 6). Finally, they suggest 
that the very weakest lines evolve much more slowly than the stronger lines (their Figure 7). Their 
models are thus in qualitative agreement with our results, but the column densities they associate 
with their shocked population and which they predict to dominate at low redshifts are smaller 
than we can detect with our FOS data. While their association of the metal-enriched lines with 
the slowly evolving shocked population might seem in apparent contradiction with our result that 
those Ly-a systems with detectable heavy elements evolve more rapidly than those without, the 
minimum degree of metal enrichment which these authors envisage is well below our threshold for 
detection. 

Regardless of whether the division of absorbers into "shocked" and "unshocked" populations 
is realistic, both the apparent rapid evolution of the Ly-a lines above a redshift of ~ 1.5, and 
the difference in rates of evolution of the Lyman-only and LWHED systems suggest that rather 
different physical processes and/or environments are involved, and these different processes and 
environments would be expected to also affect the question of the association of Ly-a absorbers 
with galaxies. 

The dependence of the rate of evolution upon the Ly-a line strength suggested by our Figure 7 
raises interesting questions about the nature of the relationship between the Ly-a absorbers and 
galaxies, and in particular about the nature of this relationship for the weaker lines. The relation 
between galaxies and low redshift Ly-a absorbers has been investigated by a large number of 
investigators over a number of years. For example, |Bahcall 197S suggested that moderate sized 



galaxy halos could account for metal-containing absorption systems while larger halos could 
account for the larger number of Ly-a lines. Subsequent to the launch of HST, the relation 
between galaxies and low redshift Ly-a lines has been examined quantitatively by a number of 
groups. In a recent extensive analyses, phen et al. 1998 examined the cross-correlation statistics 



between Ly-a absorbers and galaxies for Ly-a absorbers with equivalent widths greater than 0.3 A. 
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They find that the cross section for producing absorbers at a given equivalent width increases 
with the B-band luminosity of the galaxy (but is not strongly dependent upon morphological 
type) and that for a given luminosity there is a strong anti-correlation between equivalent width 
and impact parameter. They also find that the absorption properties show little or no correlation 
with redshift. They conclude that a significant fraction — at least 50% — of all such absorbers 
are physically associated with outer gaseous envelopes of galaxies extending to ~160 kpc. They 
further suggest that very low luminosity galaxies — whose properties and spatial density are not 
yet well-determined — could account for the remaining fraction. 

Our fits for the value of 7 of our large samples (e.g., sample 2) are significantly less than 
expected for objects whose product of comoving number density times cross section is constant, 
and which, as noted above, for the currently popular value of f2 ~ 0.2, implies a value of 7 of about 
0.9. The assumption of a constant product of comoving number density times cross section is, in 
fact, implicit in the evaluation by |Chen et al. 1998 of the expected Ly-a line density — see their 



equation (29)-and is supported by the absence of any significant correlation of galaxy-absorber 
properties with redshift in their analysis. 

Our conclusion in §7.2 that the rate of evolution depends upon line strength, and in §5 that 
the product of number density times effective cross section decreases with increasing redshift for 
the sample as a whole, leads to at least one of the following three conclusions: 

(1) The association between Ly-a absorbers and galaxies at the lower end of our equivalent 



width distribution is not as tight as implied by the |Chen et al. 1998| relation. Inspection of 
contours of constant H I column density which have been published for higher redshifts suggests 
this is likely to be the case, and for the weakest low redshift Ly-a lines the notion of well-defined 
"galaxy halos" is probably not very meaningful. 

(2) Over the redshift and equivalent width ranges of our sample, there is a more complex and 
redshift-dependent relation between line strength, galaxy luminosity and impact parameter than 
that found by |Chen et al. "1998 over the more limited range of line strength and redshift covered 



in their analysis. 

(3) The types of galaxies associated with a significant number of the Ly-a lines evolve in a 
very different way from "ordinary" galaxies. In particular, the rate of evolution we deduce for the 
weaker lines in our sample is different from the population of blue dwarfs, (which |Chen et al. 1998 



suggest may account for a significant fraction of the Ly-a lines), whose rate of evolution is more 
rapid than that of typical luminous galaxies. 

Conclusion 1) would be consistent with the failure to find a strong galaxy-Ly-a absorber 
association among the very low column density absorbers (cf. [Morris et al. 1993|) . Only more 
detailed studies of the statistics of galaxy-Ly-a association extending to weaker Ly-a lines, higher 
redshifts, and fainter galaxies can definitively settle these issues. 



We suggest that the LWHED systems involve moderate H I column density clouds, higher 
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than those associated with the weaker Lyman-only systems, but generally not as high as the 
low-ionization and Lyman limit systems, which comprise a relatively small fraction of the LWHED 
sample. These latter systems may arise in clouds pressure-confined in thermalized halos, as 
initially suggested by Bahcall & Spitzer 196£; investigated observationally by, e.g. Bergeron & 
Boisse 1991 and Steidel et al. 199~i| ; and discussed in detail theoretically by Mo &; Miralda-Escude 



1996, These absorbers may have column densities which are too high and scales which are too 



small and too closely associated with galaxies to be well-reproduced in the RPM simulations. 

The foregoing, together with the results of §7.2, imply that at low redshifts, high H I column 
density clouds evolve more rapidly than those of low H I column density, just as they do at 
higher redshifts ( Kim et al. 1997 ). However, regardless of what combination of the three scenarios 
enumerated above proves most nearly correct, it still leaves unanswered the question of why the 
line density evolves as it does as a function of redshift and line strength. The variation of the 
ionizing background radiation with redshift obviously plays an important role, but cannot be the 
sole factor, since such variations would cause the H I column densities to scale in the same way. 
However, local evolving ionizing sources could bring about differing rates of evolution by affecting 
more strongly the nearer and higher H I column density clouds. 

Our conclusions and speculations need to be, and can be, investigated more carefully using 
high resolution and high sensitivity data. In particular, at the redshifts dealt with in this paper, 
a data set taken with, e.g. STIS, at a resolution of ~ 15 km s _1 would avoid almost entirely 
ambiguities associated with line blending. A rigorous assessment of the effects of line blending at 
the FOS resolution would require detailed simulations of spectra which cover the full range of line 
strengths and density of lines encountered in our data set, and which utilize the relatively small 
amount of existing high resolution HST data. Such simulations would then need to be analyzed 
using exactly the same procedures we have used for the actual data. A few such simulations and 
analyses have been carried out (Paper II), but not at sufficiently high line densities. Lacking this, 
we give only the following qualitative and semi-quantitative comments: 

At low redshifts (e.g., below 0.5) the observed line densities in our spectra are so low that 
blending of lines from the mean background is not a significant problem, nor is the effect of line 
crowding in affecting the placing of the continuum. For example, in the spectrum of PKS 0405—12, 
the mean separation between the 4.5<r lines in our sample is about 15 times the spectral resolution, 
so blending by random superposition of lines rarely occurs. It could nevertheless be the case that 
even isolated features break up into multiple components if there is substantial velocity structure 
on scales less than our resolution. Resolving such structure would of course affect the derived 
equivalent width distribution, but unless the amplitude of such structure varied with redshift it 
would not affect the discussions of the rate of evolution. In any case, the rather sparse amount of 
high resolution data on the very low redshift Ly-a lines suggests that this is not likely to be an 
important effect. 



At the upper end of our redshift range (~1.5), the ratio of the mean line separation to the 
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spectral resolution has decreased to about five, for the reasons discussed in §4. Thus, quite 
aside from clustering, we should expect that of order 20% of our 4.5a lines would be affected by 
blending. Depending upon the detailed structure of such blends, the deblending algorithm may 
or may not break up such blends into the correct number of components, and the assignment 
of the individual equivalent widths to the components is dependent upon the detailed structure 
of the blend. The effect of blending upon the rate of evolution of the Ly-a lines is subtle and 
complex. Naively, one would expect that blending would decrease the number of lines compared 
to the number measured with higher resolution but the same limiting equivalent width. However, 
resolving the blends in such a situation also has the opposite effect, since the decreased equivalent 
width of resolved blends would cause many of the lines now in our sample to move below the 4.5<r 
level. In addition to the affects of line blending per se crowded spectra make the placement of the 
continuum increasingly subjective. 

Because of these uncertainties, in our comparison of the HST results with ground-based 



results, we have chosen to place more emphasis on the Lu et al. 1991 and Bechtold 1994 results, 
rather than higher resolution ground-based data (e.g., Kim et al. 1997) since the former two 
studies were carried out with spectral resolution and line densities more nearly comparable to 
ours. 

Finally, the foregoing discussion warrants strongly re-emphasizing the remarks made in §3.1: 
In our view, the value of our analysis and data set will be most fully realized via comparison 
between the simulated spectra of detailed hydrodynamic numerical simulations just now reaching 
zero redshift. Such simulated spectra can be modeled to have exactly the resolution and sensitivity 
characteristics of our data, and by analyzing them in precisely the same way that our FOS data set 
has been analyzed, most of the problems above will be circumvented. 

In addition to largely avoiding problems caused by line crowding, if the nature of the Ly-a 
absorbers really can be described roughly in terms of two components — or perhaps three as 
suggested above — but more realistically in terms of a continuum of environmental conditions, 
then a high resolution, high sensitivity data set would allow exploration of other properties in 
addition to metallicity and rate of evolution which would likely reflect these differing conditions. 
These would include the correlation of the Ly-a absorbers with galaxies or, more generally, the 
amount of local over- or under-density, and the clustering among the lines themselves, as well as 
their Doppler and equivalent width distributions. In CAT2 we have already presented evidence 
indicating that some Ly-a lines tend to cluster around strong heavy element-containing systems. 

The redshift regime around 1.4—1.8 appears to be an extremely interesting transition region 
insofar as the Ly-a absorbers are concerned, and efforts to study correlations in these various 
properties in this redshift regime should be carried out. With the new generation of large 
groundbased telescopes, and careful attention to the design of the spectrograph and detectors, 
Ly-a lines down to redshifts as low as ~ 1.55 can probably be studied more effectively than 
from STIS, but for redshifts below that we will have to rely on STIS or the Cosmic Origins 
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Spectrograph. 

RJW, BTJ, and LL thank R.F. Carswell, M. Rauch, L. Storrie-Lombardi and D. Weinberg, 
and RJW thanks L. Cowie and E. Hu, for helpful discussions and comments on early drafts of the 
manuscript. 
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Appendix 

We use maximum-likelihood estimation to obtain values of the parameters characterizing the 
evolution of the Ly-a lines. A detailed description of this formalism as applied to the evolution of 
the Ly-a forest was given by Murdoch et al. 1986| . Our formalism follows in general that presented 



by [Murdoch et al. 1986| but differs in a few important details, so we summarize here the method 
used in the present paper. 

We parameterize the distribution function for the Ly-a lines by: 

d 2 N 



dzdw 



A x G{z) x H(w) (A - 1) 



where A is a normalization factor and we assume a simple power law distribution for G(z), and an 
exponential distribution for H: 



G(z) = {l + zji (A -2) 

and 

H(z) = e~ w l w * (A - 3) 

where z is the redshift and w is the rest equivalent width (REW). In §5 we discuss whether these 
simple assumptions are an adequate description of the distribution function. The extension of the 
formalism described here to more complex expressions for G(z) and H(w) is straightforward but 
for the present data set they seem adequate. 

In order to explicitly obtain error estimates for the normalization constant, A, we modify the 
Murdoch et al. 1986| formulation of the maximum likelihood estimation so that the normalization 
factor A, i.e., the line density at some fixed redshift, (generally chosen to be zero) and integrated 
over some range of REWs, appears explicitly as a parameter to be determined along with, e.g., 7 
and w* . 

We imagine redshift-REW space to be divided up into a large number of cells, small enough 
that we may always neglect the probability of two or more lines being found in any one cell. If 
A n is the expected number of lines to be found in a cell at redshift z and REW w, then the 
probability of the cell having no lines in it is -Pempty = e~^ n while the probability of a cell being 
"full" (i.e., having one line in it) is Pf u \\ = e~ An x An. 

Then the probability of the ensemble of the observations yielding a set of lines filling the 
particular cells (z\, wi) , (Z2, W2) ... is 
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P = ]"'[ e- An x J] (An x e' An ) (A - 4) 

empty— cells full— cells 

The quantity to be minimized is the "entropy" , S = — In P which can be written 

S = -hxP= An ~ 12 ln ( An ) (A -5) 

all— cells full— cells 

The expected number of lines, An, in any given cell whose center is at z,w is given by 

An = Ax G(z) x H(w) x Az x Aw (A- 6) 

We can replace the first sum in equation A- 5 as an integral over all the accessible regions of 
redshift and REW space. Retaining only the terms which involve the unknown parameters, one 
obtains for the function to be minimized 



S = A 12 G{z,j,...)A(z,w*,...)b(z) dz- Y,MAxG(z t )xH(w z )) (A - 7) 

k = l • )z low,k j = l 



where Z[ om ^ and z upp ^ are the usable redshift limits for the k — th QSO (see Table 1), the first 
sum is over all the QSOs in the sample, and the second sum is over all the lines in the sample. 
The quantity A is defined as 

A = / H(w) dw (A - 8) 

provided wn m < wz, and is zero otherwise, and where w m i n is the greater of [wi , wn m ]. 
The two values of the REW, w\ and W2, define any particular sample of lines satisfying 

< W\ < W < W2 < oo. 

The function wn m (z) is the REW at the wavelength corresponding to Ly-a at redshift z that 
yields a 4.5cr detection for the QSO in question. These functions are taken from the Catalog. 

The quantity b(z) is the "blocking factor" defined in CAT1. Note that the normalization 
factor A appears explicitly as an unknown factor with an associated uncertainty, in contrast to 
the formalism in Murdoch et al. The normalization constant A, along with 7, and w* are found 
by finding their values for which S is minimized. In the case of the normalization constant, this 
can be done explicitly and setting dS/dA = 0, and solving for A leads to 



i N N \ 9 

" " i:rr k k G(z, 1 ,...)A(z, w *...)b(z) dz = oW^ 
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where Q, the effective path length, is defined by equation A-9. Equation A-9 is an obvious 
result, but this formulation leads to formal estimates in the uncertainty in A and the coupling 
between its uncertainty and the uncertainty in the other parameters, as described below. Note 
that the effective path length, Q, takes into account the variation in signal-to-noise as a function 
of wavelength for each individual spectrum, and couples the equations determining 7 and w* 
in a complex way. This expression for the path length also depends explicitly, of course, on 
the assumption of the distribution of rest equivalent widths being adequately described by an 
exponential, leading to variable REW detection limit ("V") samples. To avoid this assumption 
we can restrict the spectral regions which contribute to the path length and to the lines in the 
sample to those regions having a detection limit which is everywhere more sensitive than some 
specified value, and consider only lines above this value (uniform detection limit, or "U" samples). 
As described in the text, the results for the two cases are generally quite similar for large samples. 

The expression for A in equation A-9 can then be substituted into equation A- 7 and values 
for 7 and w* found. 

We have found it most convenient to solve for these remaining parameters by 
using the simplex minimization routine AMOEBA as given in Press et al. 19"8G| . 

Error Estimation 

In this section we describe the method we have used for estimating the uncertainties in the fitting 
parameters, but also describe two other slightly different approaches, and compare the three 
methods to get some indication of the reliability of the error estimates. 

In general, if P(A,w* ,7) is the (normalized) probability distribution for the 3 fitting 
parameters and (for example) 

/•CO 

P'( 1 )= P(A,w*,-y) dAdw* (-4-10) 
Jo 

is the marginal distribution of 7, if we have no a priori knowledge of the other parameters, then 
lower and upper confidence limits (7^ and j up for 7) are given by the expressions 

= r° p'b) ^ (a ~ no) 

J — 00 



and 



c up = r p p'(rr) d 1 {A -lib) 

j —00 



whereas the dispersion in the value of 7 is, by definition, 

r+00 

a 2 (7) = / P'( 7 ) x (7 - 7 avg) 2 dj (A - 12) 
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For a large number of lines in the sample, and for a large range in redshift we expect the 
"entropy" to have a sharp minimum around the most probable values and to rise steeply around 
this minimum, so that expanding the entropy function around the most probable value and keeping 
only 2nd order terms should be a good approximation. In this case the probability function is 
Gaussian with the surfaces of constant probability being ellipsoids. The la confidence limits, with 
C\ = 0.1587 and C up = 0.8413, then yield the same estimates as the expression for the dispersion 
in equation A-12. 

In general, the principal axes of the error ellipsoid do not coincide with the A, 7 and w* axes 
(i.e., there are non-zero mixed 2nd derivatives in the expansion of the entropy function) and the 
integral in A-12 is most conveniently evaluated by transforming to the principle axes and then 
carrying out the appropriate integrals. This method of estimating the errors in the maximum 
likelihood function and using the Gaussian approximation is the one used for the uncertainties 
quoted in Table 2 and we refer to this method as "method 1". 

For relatively small samples, the Gaussian approximation becomes poorer, and the surfaces 
of constant probabilities typically take on a "banana shape" (cf. Storrie-Lombardi et al. 1996| ). 
As in Storrie-Lombardi et al. , estimates for the uncertainty in the values of the fitting parameters 
have sometimes been made by locating the entropy contour for which S — S m i n = 0.5 and taking 
e.g., 07, to be the extremes of 7 defined by this contour. We denote this method by "method 2". 

When the Gaussian approximation is not used, the marginal distribution function is not 
symmetric about the most probable value and the average of the marginal distribution function is 
not the most probable value. An alternative to method 2 is to use equations A-lla and A-llb to 
evaluate the confidence limits. This method is somewhat preferable to method 2 since it uses the 
distribution of S over the entire parameter space. Rather than evaluating the integrals A-lla and 
A-llb numerically over the probability function based upon a single realization — e.g., the fixed 
(observed) number of lines along each line of sight, and the fixed observed distribution of those 
lines in redshift-equivalent width space — we regard the values determined from the maximum 
likelihood estimation as representing some true underlying distribution. We then use Monte Carlo 
methods to: (i) sample the number of lines along any sightline from a Poisson distribution whose 
expectation value is that appropriate for the sensitivity and spectral coverage of that sightline (ii) 
select the equivalent width and redshift for the lines along this sightline from the underlying 7 
and w* distributions (iii) for each realization, use the maximum likelihood code to give values of 
A, 7 and w* . The ensemble of realizations then populates empirically the density of points in A, 
7, w* space. One may then simply sort the resultant values of, e.g., 7 from all the trials, and 
by inspection determine the values corresponding to the two confidence limits. This procedure 
probably comes closest to giving a true estimate of the uncertainties in the parameter estimates 
and also has the advantage that it serves as a check on the code. We refer to this procedure as 
"method 3" . 



In practice, for the full data set discussed in this paper, and for the determination of all 
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three parameters, method 3 requires a prohibitive amount of computation, and even method 2 is 
rather cumbersome. In order to compare these three methods therefore, we have considered a very 
simple case involving a single line of sight, with a uniform detection limit, so that only the A and 
7 variables are coupled. 

Specifically, imagine a line of sight spanning the redshift range z = 0.0 to z = 1.5 with an 
underlying value of 7 = 1.0, and consider three cases for which the expected number of lines is 
10, 100 and 1000. For methods 1 and 2 we take the sum of ln(l + Zj) (see equations A— 2, A-6 
and A— 7) over the lines in the sample to be the mean expected for 7 = 1.0. The results for these 
three methods for estimating the uncertainty in 7 are summarized in Table A-l. For method 1, 
the values of a + and a~ are equal and the uncertainty goes exactly as l/\/N, but for methods 2 
and 3 the upper and lower confidence limits are slightly different and the scaling of the confidence 
limits as l/y/N is only approximate. Inspection of Table A-l shows, however, that for the sample 
sizes dealt with in this paper, the method we have used should give results which are accurate to 
within a few percent. 

The "True" Distribution Functions in Redshift and Rest Equivalent Width 

If the minimum detectable equivalent width at every redshift were uniform, and if there 
were an equal number of sightlines at every redshift, then the raw distribution functions would 
represent a fair sample of the true distributions of lines as a function of equivalent width and 
redshift. In general, neither of these conditions is satisfied. As mentioned in the text, we can 
calculate cumulative distribution functions in terms of "effective redshift path lengths" (as in 



Storrie-Lombardi et al. 1996 ) and "effective equivalent width path lengths", in which case every 
detected line increments the empirical distribution function by an equal amount. In practice, 
however, these path lengths are highly non-linear functions of redshift and equivalent width. We 
have therefore elected to weight each line to compensate for the incompleteness in the redshift and 
equivalent width coverage. 

We define a "corrected cumulative distribution function" for the redshift distribution (for the 
"V" sample) by 

C(z) = z < zi (A - 13a) 

C{z) = 1 z > z N (A- 136) 

k 

C(z) = VV'Oj) z k <z < z k+1 (A - 13c) 

i=i 

where Zj is the j th redshift and there are N lines in the sample, and where 

^( Z j) = eX P[- W dct,m(zj)] (A - 14) 



In equation A-14 the sum is over all the QSOs whose spectra include the wavelength 
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corresponding to a Ly-a line of redshift z, and Wdet,m is the 4.5a detection REW at that 
wavelength for the m — th QSO. (The normalization to 1.0 is performed after the initial sum is 
calculated.) 

For the corrected equivalent width distribution the analogous expressions are given by 



C(w) = 
C(w) = 1 

k 

CM = ]Ti/AK) 

where A(vjj) is given by 



w < w\ (A — 15a) 

w > wn (A — 15b) 

Wk < w < ttjfc+i (A — 15c) 



Hwj) = £ / 0(^et,m(A), Wj) X (1 + z)T dA (A - 16) 

and where \ m in,m and \ ma x,m are the usable minimum and maximum wavelengths for each QSO 
(corresponding to the Ly-a redshifts of Table 1) and 4>(wdet,m{^)-,Wj) is 1 for Wj > Wd e t,m and 
otherwise. 

These formulations give a reasonable representation of the "true" distribution functions, but 
give large weights to lines found in sparsely sampled portions of redshift or equivalent width space 
(Figure 2). This can be seen, for example in the jump in the cumulative redshift distribution 
function (Figure 3) in the region around redshifts from about 0.31 to 0.36, where the coverage 
from the G130H grating stops and the region from 1600 to 1650 A in the G190H grating has very 
low sensitivity. 
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Table 1. Objects From the HST Quasar Absorption Line Survey Included in the Current 

Analysis 



Object 


RA 


B1950 


Dec 




^beg 


^end 


3C 273 


12 26 


33.3 + 


02 19 43 


0.158 


0.0020 


0.1464 


PG 1116+215 


11 16 


30.2 + 


21 35 43 


0.177 


0.0020 


0.1652 


PG 0953+415 


09 53 


48.2 + 


41 29 40 


0.239 


0.0020 


0.2266 


pi/c i ono 1 no 

rJ\o loUz — lUz 


1 ^ no 
lo Uz 


oo. y — 


in 17 17 
1U 1 i Li 


n os^ 

U.ZOO 


n nnon 
u.uu/u 


n 07*3 1 
U.z / ol 


H 1821+643 


18 21 


41.9 + 


64 19 01 


0.297 


0.0020 


0.2840 


3C 249.1 


11 00 


27.4 + 


77 15 09 


0.311 


0.0020 


0.2979 


PKS 2251+11 


22 51 


40.6 + 


11 20 39 


0.323 


0.0020 


0.3098 


TON 28 


10 01 


10.7 + 


29 10 09 


0.329 


0.0020 


0.3157 


r\j IzlD+UDy 


1*) it; 
1Z 10 


4 / .0 + 


C\« ^^17 
UD 00 1 i 


n iia 


n nnon 
u.uu/u 


n ^on^ 


PG 1049-005 


10 49 


18.0 - 


00 35 21 


0.357 


0.3005 


0.3434 


HS 0624+6907 


06 24 


35.0 + 


69 07 03 


0.370 


0.0020 


0.3563 


B2 1512+37 


15 12 


47.0 + 


37 01 55 


0.370 


0.2943 


0.3563 


3C 351.0 


17 04 


03.5 + 


60 48 31 


0.371 


0.0020 


0.3573 




UU 4o 


1 O « _L 

1Z.0 + 


n*3 ^/i m 

Uo 04 Ul 


n ^Q/i 
U.oo4 


u./you 


n ^7no 


PKS 0003+15 


00 03 


25.1 + 


15 53 07 


0.450 


0.0020 


0.4355 


B2 0742+31 


07 42 


30.8 + 


31 50 16 


0.462 


0.3001 


0.4474 


PG 1259+593 


12 59 


08.2 + 


59 18 15 


0.472 


0.0020 


0.4573 


PKS 2128-12 


21 28 


52.8 - 


12 20 21 


0.501 


0.3002 


0.4860 


1 1 ^n_i_i n^v 


1 1 ^n 


00. u -+- 


1 1 ns 

11 Uo 08 


n ^1 

U.01 


n ^nnn 

U.oUUU 


n a O/i o 
u.4y4y 


PKS 2300-683 


23 00 


27.9 - 


68 23 47 


0.512 


0.3001 


0.4969 


US 1867 


08 50 


13.4 + 


44 00 24 


0.513 


0.2936 


0.4979 


3C 232 


09 55 


25.5 + 


32 38 23 


0.533 


0.3167 


0.5177 


NGC 2841 UB3 


09 16 


30.0 + 


51 18 53 


0.553 


0.3000 


0.5375 


PKS 1136-13 


11 36 


38.5 - 


13 34 05 


0.554 


0.3003 


0.5385 


PG 1333+176 


13 33 


36.8 + 


17 40 31 


0.554 


0.3002 


0.5385 


3C 334.0 


16 18 


07.3 + 


17 43 31 


0.555 


0.3001 


0.5395 


PKS 0405-12 


04 05 


27.4 - 


12 19 32 


0.574 


0.0020 


0.5583 


PKS 0439-433 


04 39 


42.7 - 


43 19 25 


0.593 


0.3001 


0.5771 


4C 41.21 


10 07 


26.1 + 


41 47 26 


0.613 


0.3001 


0.5969 


3C 95 


03 49 


09.5 - 


14 38 06 


0.614 


0.2936 


0.5979 


PKS 0044+030 


00 44 


31.4 + 


03 03 33 


0.624 


0.2948 


0.6078 


PKS 2243-123 


22 43 


39.8 - 


12 22 40 


0.630 


0.3002 


0.6137 


MC 1104+167 


11 04 


36.6 + 


16 44 16 


0.634 


0.3002 


0.6177 


3C 263 


11 37 


09.4 + 


66 04 27 


0.652 


0.2939 


0.6355 


PKS 0637-75 


06 37 


23.5 - 


75 13 38 


0.654 


0.3001 


0.6375 
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Table 1 — Continued 



Object 


RA 


B1950 


Dec 




^beg 


•'end 


3C 57 


01 59 


30.4 - 


11 47 00 


0.670 


0.3000 


0.6533 


PKS 2344+09 


23 44 


03.9 + 


09 14 06 


0.677 


0.3002 


0.6602 


T"> O O 9 £ O 9 /I O 

rKa 2602 — 34/ 


26 52 


50. / — 


9/1 1/1 9 O 

34 14 38 


0. (02 


0.3000 


n a o £ n 
0.6850 


PKS 1354+19 


13 54 


42.2 + 


19 33 44 


0.720 


3002 


0.7028 


0959+68W1 


09 59 


09.7 + 


68 27 48 


0.773 


0.3001 


0.7553 


3C 110 


04 14 


49.2 - 


06 01 05 


0.773 


0.3001 


0.7553 


PG 1538+477 


15 38 


00.9 + 


47 45 10 


0.770 


0.3000 


0.7553 


T"> O 1/10/1 11 

rKa 1424—11 


1/1 O A 

14 24 


56.0 — 


11 £ n o £ 
11 50 25 


n one 
0.805 


n 9 nn 9 
0.3003 


0. (869 


3C 454 3 


22 51 


29.6 + 


15 52 54 


0.859 


2936 


0.8404 


PKS 1252+11 


12 52 


07.7 + 


11 57 21 


0.870 


0.3001 


0.8513 


PKS 2340-036 


23 40 


22.6 - 


03 39 06 


0.896 


0.3001 


0.8770 


PG 1407+265 


14 07 


07.8 + 


26 32 31 


0.94* 


0.3002 


0.9246 


rKa 2145+06 


1 /I £ 

21 45 


36.2 + 


f\d AO A 1 

06 43 41 


0.990 


0.2936 


0.9/01 


TON 153 


13 17 


34.3 + 


27 43 52 


1.022 


2998 


1.0018 


PG 1248+401 


12 48 


26.6 + 


40 07 59 


1.030 


0.3001 


1.0097 




23 02 


12.2 + 


02 55 34 


1 052 

_1_ . \J'.J Zj 


3002 


1 031 5 


PKS 0122-00 


01 22 


55.3 - 


00 21 32 


1.070 


0.2998 


1.0493 


NAB 0024+22 


00 24 


38.5 + 


22 25 23 


1.118 


0.3001 


1.0968 


PG 1352+011 


13 52 


25.5 + 


01 06 51 


1.121 


0.2950 


1.0998 


PG 1338+416 


13 38 


52.1 + 


41 38 22 


1.219 


0.8291 


1.1968 


4C 06.41 


10 38 


40.9 + 


06 25 59 


1.270 


0.8288 


1.2473 


PG 1241+176 


12 41 


41.0 + 


17 37 29 


1.273 


0.8289 


1.2503 


PG 1008+133 


10 08 


29.8 + 


13 19 01 


1.287 


0.8302 


1.2641 


PG 1634+706 


16 34 


51.7 + 


70 37 38 


1.334 


0.8284 


1.3107 


B2011257+34 


12 57 


26.6 + 


34 39 32 


1.375 


0.8296 


1.3512 


PKS 0232-04 


02 32 


36.6 - 


04 15 11 


1.434 


0.8287 


1.4097 


PG 0117+213 


01 17 


34.7 + 


21 18 03 


1.493 


0.8288 


1.4681 


PKS 0743-67 


07 43 


22.3 - 


67 19 08 


1.51 


0.8281 


1.4849 



a The value of z en( j in the table has been calculated using a proximity value cutoff of 3000 km s 1 as 
discussed in §3. 

b The redshift of PG 1407+265 is quite uncertain. See the discussion in section 4 of CAT3. 
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TABLE 2 











Results 


OF THE 


Maximum 


Likelihood Fits 








(1) 


N 
(2) 




REW 


B,L,M a 
(4) 


u/v b 

(5) 


A prox 

(6) 


dN/dz c 


7 




W* (A) 


(lo) 


(3) (hi) 




(7) 








(9) 


1 


987 


0.0 


oo 


B 


V 





35.6 


± 3.2 


±0.11 


±0.16 


0.272 


±0.008 


2 


920 


0.0 


oo 


B 


V 


3000 


34.7 


± 3.2 


±0.16 


±0.16 


0.273 


±0.009 


3 


493 


0.24 


oo 


B 


u 


3000 


32.7 


± 4.2 


±0.26 


±0.22 


0.283 


±0.012 


4 


837 


0.0 


oo 


L 


V 


3000 


33.6 


± 3.3 


-0.03 


±0.17 


0.235 


±0.008 


5 


465 


0.24 


oo 


L 


u 


3000 


30.7 


± 4.2 


±0.15 


±0.23 


0.237 


±0.011 


6 


79 


0.0 


oo 


M 


V 


3000 


2.0 


± 0.6 


±1.24 


±0.55 


0.678 


±0.075 


7 


67 


0.40 


oo 


M 


u 


3000 


2.1 


± 0.8 


±1.55 


±0.61 


0.551 


±0.067 


8 


105 


0.70 


oo 


L 


V 


3000 


31.3 


±10.8 


-0.32 


±0.47 


0.265 


±0.026 


9 


103 


0.70 


oo 


L 


u 


3000 


33.9 


±12.3 


-0.32 


±0.47 


0.265 


±0.026 


10 


340 


0.40 


oo 


L 


u 


3000 


33.0 


± 5.6 


±0.16 


±0.26 


0.215 


±0.017 



a B= All Ly-a lines included independant of the presence of associated heavy element (metal line) absorption; L= Only 
Ly-a lines with no metal lines within 300 km s _1 were included; M= Only Ly-a lines associated with metal lines (< 300 
km s -1 ) were included. 

b U= Uniform Detection Limit sample; V=Variable Detection Limit sample; see text. 

c The value of dN/dz is evaluated at z = 0, and is normalized to refer to lines having REW> 0.24A. 



TABLE A-l 
Comparison of Error Estimates for j a 



Method 


N= 


10 


N= 


100 


N= 


1000 


1 

2 
3 


1.295 
1.259 
1.233 


1.295 
1.345 
1.527 


0.4096 
0.4140 
0.3961 


0.4096 
0.4055 
0.4366 


0.1295 
0.1291 
0.1284 


0.1295 
0.1300 
0.1289 


a Calculated for ue 


derlying 


7 = LO, 


^min — 0.0, 


^max — 


:1.5 
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Wavelength (A) 

1500 2000 2500 3000 




Redshift 

Fig. 1. — Typical 4.5ct equivalent width detection limits for the three gratings (G130H, G190H, G270H) used in 
this study. The usable portion of G130H covers the wavelength range from about 1220 to 1600 A, while the usable 
ranges of the G190H and G270H gratings are about 1650 to 2300 A, and 2300 to 3270 A. The dotted line is the 
observed equivalent width limit and the solid line is the rest equivalent width limit. 
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Redsfnf't (z) 

Fig. 2. — The observed distribution of the 987 Ly-a lines of sample 1 as a function of redshift and rest equivalent 
width. The histogram gives the number of lines of sight in the entire Catalogue as a function of redshift. 
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1.6 



Fig. 3. — The cumulative distribution function for the lines of sample 1 vs. redshift. The solid line is the cumulative 
distribution corrected for incompleteness, as described in the Appendix. The dashed line is the maximum likelihood 
fit to a power law in (1 + z), while the dotted line represents the raw data. 
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5- 



0.3 0.6 0.9 

Rest Equivalent Width (A) 



Fig. 4. — The (differential) binned distribution (log 10 dN/dw) of lines vs. rest equivalent width for sample 1, for 
nine intervals of width 0.1 A between 0.1 and 1.0 A. Open symbols represent the raw data, while the closed symbols 
are the data corrected for incompleteness, as described in the Appendix. The error bars for the corrected distribution 
reflect the root N statistics. The dashed line is the maximum likelihood fit to an exponential distribution. The scale 
for the ordinate indicates only relative, not absolute, numbers, and the raw data have been shifted relative to the 
corrected data for clarity. 
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11 12 13 14 15 16 17 18 19 
Log (N(U I)) 

Fig. 5. — Three Ly -q curves of growth for values of the Doppler parameter of 40 (dashed line), 30 (solid line) and 
20 (dotted line) km s _1 , (right scale) and an approximate conversion of the plot in Figure 4 from equivalent width 
to H I column density. The solid symbols represent the transformation of the binned corrected distribution in rest 
equivalent width of Figure 4 to log Nhi column density, assuming a Doppler parameter of 30 km s _1 . The lefthand 
scale for the ordinate indicates only relative, not absolute, numbers. The extreme sensitivity of the inferred H I 
column density for given equivalent width to the Doppler parameter means that the last several points at the high 
column density of the Figure are not to be taken literally. The approximate slope log (dN / dN(R I)) for the weakest 
lines is about -1.3. See the text for further discussion. 
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i30 

o 



■ HST Sample 2 

▲ UM 18 Bin 

* Lu et al. 1991 

O Deohtold 1994 




Normalized for Rest W = 0.24 



Log (1 + z) 

Fig. 6. — The log of the Ly-a line density per unit redshift for sample 2, evaluated at the midpoint of five bins equally 
spaced in log(l + z) from z = to z — 1.5 (solid squares), and for the Ly-a lines in UM 18 in the redshift interval from 
1.5 to 1.7 (triangle). The line through the solid squares is the best fit power law for sample 2. The lighter symbols 
are taken from groundbased surveys of Lu et al. 1991 (*'s) and Bechtold 1994 (open circles). The normalization for 
the groundbased data has been increased by a factor exp(0.36 — 0.24)/0.276 to adjust the normalization from 0.36 to 
the 0.24 A used for the FOS data. The line through the groundbased data has a slope of 1.85, the best fit found by 



Bechtold 1994 
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0.3 0.6 0.9 

Rest Equivalent Width (A) 



Fig. 7. — The dependence of 7 upon rest equivalent width for the lines in sample 2. The error bars are estimates 
which incorporate both the formal fitting errors and the uncertainties associated with the assumed value of w* in 
carrying out the fits. See the text for further discussion. 



